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(54) Processing multiple datat>ase transactions in the same process to reduce process overhead 
and redundant retrieval from database servers 



(57) The present invention uses a segmented cach- 
ing data structure to cache database objects prcvkied 
by a database server. The database server provides 
database objects in response to requests by a number 
of different programs. The segmented caching data 
structure is made up of a single central cache and a 
number of program caches, each corresponding to one 
of the programs. When a database object is provided by 
the database server in response to a request by any of 
the programs, a copy of the database object is stored in 



the central cache. Another copy of the object is stored in 
the program cache for the program that requested the 
database object. When the segmented caching data 
structure is maintained in this manner, when a request 
is made by one of the programs a copy of the requested 
object stored in either of the central cache or the pro- 
gram cache for the program may be used, making it 
unnecessary for the database server to provide the 
requested database object. 
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Descrlpll n 

Technical Field 

The invention relates generally to the fields of data- 
base transaction processing and caching the results of 
retrieval requests. 

BackflrevPd of the Invgntlor) 

It is common to use databases to store significant 
quantities of data, especially in cases in which the data 
is shared by a number of database-accessing pro- 
grams. Database-accessing programs that access data 
in a database may be executed on a nunriser of different 
connected computer system. These programs issue a 
series of database transactions, each corresponding to 
one or more operations on the database, including read 
and write operations. 

When two or more such progranns are executed on 
the same computer system, they are typically each exe- 
cuted in a separate process. Each process corresponds 
to a set of resources provided by the operating system, 
most notably an addressable range of memory, that is 
available only to programs ("threads") running within the 
process. Database-accessing programs are generally 
each executed in a separate process to prevent them 
from corrupting each other's data. Because datat>ase- 
accessing programs executing in separate processes 
cannot share data, the results of a read operation 
obtained by one database-accessing program are una- 
vailable to other database-accessing programs that 
issue the same read operations. Indeed, because each 
database-accessing program typically discards the 
results of read operation performed as part of a transac- 
tion when the transaction completes, a single database- 
accessing program may have to issue the same read 
operation two or more times in a short period of time. 
These redundant transactions again must be applied 
directly against the database, which has significant time 
cost. First, the database-accessing program must trans- 
mit the transaction across a network to the computer 
system containing the database, which can take a sub- 
stantial amount of time. Further, the to actually apply the 
transaction against the database, the database-access- 
ing program must obtain the appropriate locks, or 
access controls, on the database, which can involve fur- 
ther network communication and synchronization with 
database-accessing programs executing on still other 
computer systems. 

Further, because each process has an extensive 
set of resources devoted to it. the operations of creating 
and destroying processes each have significant time 
cost. In the conventional approach of executing each 
database-accessing program in a separate process, 
this significant time cost is incurred any time an data- 
base-accessing program begins or ends execution. In 
addition, many of the kinds resources devoted to sepa- 



rate processes are scarce, and allocating shares of 
these kinds of resources to each of a large number of 
processes further degrades the performance of the 
computer system and limits the number of other pro- 
5 grams that can simultaneously execute on the computer 
system. 

Given the significant disadvantages of the conven- 
tional approach to executing database database- 
accessing programs, an alternative approach to execut- 
10 ing database programs that reduces redundant retrieval 
from database servers and reduces process overhead 
would have significant utility. 

SummarvofthB Invention 

75 

In accordance with the present invention a data- 
base object caching facility ("the factlity") maintains a 
hierarchy of caches that enables database-accessing 
programs processing database transactions to share 

20 retrieved database objects across database transac- 
tions, reducing process overhead and redundant 
retrieval from database servers, while still maintaining 
read-repeatable transactional isolation. "Database 
ot^ects" as used herein means any unit of data that may 

25 be retrieved from a database including tables, fields, 
files, programmatic objects, and other units of data. 

The facility executes multiple database-accessing 
programs in the same process, and utilizes a hierarchy 
of caches to cache database objects retrieved from a 

30 database server. The hierarchy of caches includes one 
program cache for each database-accessing program, 
and a single process cache. The facility uses each pro- 
gram cache to store database objects recently retrieved 
by the cache's database-accessing program, and uses 

35 the process cache to store database objects recently 
retrieved by any of the database-accessing programs. 
The program caches each allow a database-accessing 
program to quickly obtain a database object retrieved 
earlier by the same database-accessing program. The 

40 process cache, on the other hand, allows a database- 
accessing program to quickly obtain a database object 
recently retrieved by other database-accessing pro- 
grams. The hierarchy of caches may optionally include 
additional caches. 

45 

Brief Pe scripti on of t h e Praw inqs 

Rgure 1 is an overview block diagram showing the 
network of cornputer systems that the facility preferably 
so operates on. 

Rgure 2 is a memory diagram showing selective 
contents of the memories of a database-accessing 
computer system and the database server computer 
system. 

55 Figure 3 is a flow diagram showing the steps prefer- 
ably performed by the facility when a new database- 
accessing program begins executing in a particular 
process. 
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Figure 4 is a flow diagram showing the steps prefer- 
ably performed by the facility when a database-access- 
ing program is being terminated. 

Figure 5 is a flow diagram showing the steps prefer- 
ably performed by the facility in order to obtain a data- 
base object using its object identifier. 

Figure 6 Is a memory diagram showing the results 
of performing step 507. 

Figure 7 is a memory diagram showing the results 
of performing steps 51 0 and 507. 

Figure 8 is a flow diagram showing the steps prefer- 
ably performed by the facility in order to update an 
object and preference by a specified object identifier. 

Figure 9 is a memory diagram showing the results 
of performing step 802. 

Figure 10 is a flow diagram showing the steps pref- 
erably performed by the facility in order to update a 
manipulated object in the datat)ase. 

Figure 1 1 is a memory diagram showing the results 
of performing steps 1001-1005. 

Figure 12 is a flow diagram showing the steps pref- 
erably performed by the database server in order to 
update the version of the manipulated object stored in 
the database to reflect the manipulations to the manipu- 
lated object. 

Figure 13 is a flow diagram showing the steps pref- 
erably performed by the facility on a database-access- 
ing computer system in response to the remote 
procedure call of step 1202. 

Figure 14 is a memory diagram showing the results 
of performing step 1205. 

Figure 15 is a memory diagram showing the results 
of performing step 1010. 

PetailMl Description of the Inventjon 

In accordance with the present invention, a data- 
base object caching facility ("facility') maintains a hier- 
archy of caches that enables database-accessing 
programs processing database transactions to share 
retrieved database objects across database transac- 
tions, reducing process overhead and redundant 
retrieval from database servers, while still maintaining 
read-repeatable transactional isolation. (Read-repeata- 
ble transactional isolation is a property of database sys- 
tems according to which a database accessing program 
can only retrieve committed versions of database 
objects and no other transaction that modifies any of 
such objects is permitted to commit until the database- 
accessing program has completed.) "Database objects" 
as used herein means any unit of data that may be 
retrieved from a database including tables, fields, files, 
programmatic objects, and other units of data. The 
invention addresses the execution of multiple database- 
accessing programs, which each produce a stream of 
related database transactions, each involving one or 
more database operations. 

According to the invention, several database- 



accessing programs are executed in a single process, 
allowing database objects and other data to be shared 
between these programs. In a prefenred embodiment, a 
number of such processes each execute several data- 

5 base-accessing programs. Database-accessing pro- 
grams each have their own program cache for caching 
retrieved database objects. The invention preferably 
maintains the contents of the program cache for subse- 
quent transactions processed by the same program. 

10 which have a significant likelihood of using at least 
some of the objects used by foregoing transactions. 
Another cache, called the process cache, caches all of 
the database objects retrieved by any of the database- 
accessing programs In the process. The process cache, 

IS too, is maintained across individual transactions. When 
a transaction being processed by one of the database- 
accessing programs needs a database object, the facil- 
ity first searches the program cache for the database- 
accessing program and, if the program cache contains 

20 the database objects, the database-accessing program 
accesses it in its program cache. If the database- 
accessing program's program cache does not contain 
the needed database object, the database-accessing 
facility searches the process cache for the needed 

2$ oksject, and. if the process cache contains the needed 
datak>ase object copies it to the program cache where 
the database-accessing program can access It and 
make changes to it without affecting the version of the 
database object used by other database-accessing pro- 

30 grams in the same process. If the needed database 
ot)ject is in neither the program nor the process cache, 
the facility retrieves the needed object from a database 
server and stores the retrieved database object In both 
the process cache and the program cache. In a pre- 
ss fenred embodiment, as part of reading the database 
object from the database and copying to the process 
cache, additional database objects not requested are 
also read and copied to the process cache, allowing 
these additional objects to be quickly obtained from the 

40 process cache if they are needed for a subsequent 
transaction. 

The facility purges any copies of a database object 
from all of these caches if notification is received from 
the database server that the object is being modified by 

45 another program, ensuring that any copies of database 
objects contained by a cache are identical to the data- 
base object stored in the database, which in turn 
ensures read-repeatable transaction isolation. 

Access to the program cache and the process 

50 cache are each protected by a synchronization mecha- 
nism called a lock. As the program cache is accessed 
almost exclusively by its program, however, its program 
can obtain this lock quickly in most cases. Therefore, if 
the program cache is sufficiently large, the time cost of 

55 retrieving an object recently used by the same program 
from tiie program cache is very small. While the time 
cost of retrieving an object recently retrieved by a differ- 
ent program in the same process from the process 
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cache is larger because all of the programs of the proc- 
ess can contend for the lock protecting access to the 
process cache, this time cost is still significantly smajler 
than the time cost of using a database sen/er (which 
often executes on a different machine) to retrieve the 
object from the database. 

Figure 1 1s an overview block diagram showing the 
network of computer systems that the facility preferably 
operates on. it can be seen from Figure 1 that a network 
199 connects a database server computer system 100 
with a number of database-accessing computer sys- 
tems, including database-accessing computer systems 
130 and 160. The database computer system 100 main- 
tains the contents of the database, and services 
requests from all of the database-accessing computer 
systems to retrieve an update object stored in the data- 
base In a preferred embodiment, additional database 
server computer systems are connected to the network 
199 (not shown) having replicated copies of the data- 
base to share the load on the database server computer 
system 100. 

The database server computer system 100 is con- 
nected to the network 1 99 by a network connection 111. 
The database server conrputer system further contains 
an array of central processing units (CPUs) 101, 102, 
103, and 104, having processor caches 106. 107. 108, 
and 109, respectively. The database 121 is stored by 
the database server computer system on a storage 
device 121, such as a redundant array of hard disk 
drives. The database server computer system 100 also 
contains main memory (memory) 112. which in turn 
contains database server programs 113 for accessing 
the database 122. as well as data 1 14 utilized by the 
database server programs 1 13. 

A representative one of the database-accessing 
computer systems is discussed herein. The database- 
accessing computer system 130 is connected to the 
network 199 by a network connection 141. Database- 
accessing computer system 130 contains an anray of 
processors 131, 132, 133, and 134, having processor 
caches 136. 137, 138. and 139, respectively. The data- 
base-accessing computer system further contains main 
memory (memory) 142. The memory 142 in turn con- 
tains a number of process address spaces, including 
process address spaces 145 and 146. The process 
address space 145 fully shown is representative of 
other process address spaces. It contains programs, 
including the facility 147 and database-accessing pro- 
grams, such as database-accessing program A 148 and 
database-accessing program B 149. The process 
address space 145 further includes data 150 used by 
the programs of the process address space. The data- 
base-accessing computer system 130 further includes 
a removable media drive 159, which can be used to 
install software products. Including the facility, which are 
provided on a computer-readable medium, such as a 
CD-ROM. While the facility is preferably implemented 
on the connected group of computer systems config* 



ured as described above, those skilled in the art will rec- 
ognize that it may also be implemented on computer 
systems having different configurations. For example, 
some of the computer systems may not have features 

s shown herein, or may have features not shown herein. 
Further, these features may preferably be embodied in 
other arrangements of computer systems, including 
being an'anged in a single large computer system. Fur- 
ther, the network 199 may be any type of network, 

10 including the Internet, intranets, local area networks, 
wide area networks, and ad hoc systems of connec- 
tions. 

Rgure 2 is a memory diagram showing selective 
contents of the memories of a database-accessing 

15 computer system and the database server computer 
system. Rgure 2 shows a portion of the data 250 in the 
process address space 245 in the memory of the data- 
base-accessing computer system 230. The data 150 
includes a single process cache 281 having a lock 282. 

20 The lacility uses the process cache 281 to cache data- 
base objects retrieved by any of the database-access- 
ing programs executing in the process having process 
address space 245. The data 250 further includes a 
program cache for each data accessing program exe- 

25 cuting in this process: a program A cache 283 having a 
lock 284, and a program B cache 285 having a lock 286. 
Each cache is shown as a table that maps from object 
identifiers to data that comprises the object referenced 
by the object identifier. In each case, the existence of 

30 this data in a cache is a result of retrieving this data from 
the database. The caches shown differ from the actual 
caches preferably used by the facility in a number of 
minor respects to better illustrate their use by the facility. 
Rrst, while object identifiers are shown as four-digit dec- 

35 imal numbers, the facility preferably uses object identifi- 
ers that are 32-digit hexadecimal numbers in order to 
ensure that the object identifiers uniquely identify the 
database objects to which they refer. Also, while the 
data mapped to from each object identifier may be thou- 

Ao sands of bytes long, in each case only one decimal digit 
of the data is shown. Further, while the caches are 
shown as simple tables, they are preferably imple- 
mented as hash tables that hash from object identifiers 
to hash table slots containing pointers to the data com- 

45 prising each object in a variable-length area of the 
cache. The caches each have a limited size, and prefer- 
ably discard data that is old and/or no longer being used 
when their capacity is reached to make room for new 
data. 

so Each lock identifies the one thread currently having 
authority to read or nuxlify the cache to which it con'e- 
sponds. Each lock preferably contains the thread identi- 
fier of the thread having such authority, or a null thread 
identifier when no thread has such authority. The 

55 threads that may have authority to read and modify the 
shown caches include the threads executing each of the 
database-accessing programs executing in the process 
as well as the special thread used by the facility to flush 
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modified database objects from the caches. 

Figure 2 further shows that the process cache 281 
contains the data comprising the object in a compact, 
sequential "streamed" or "fflaf form in which the con- 
tents of the object are stored in the database and conv 
municated across the network, it can further be seen 
that the program caches 283 and 285 contain the data 
comprising each database object in a different "con- 
structed" form, in which the database objects are usable 
by the accessing programs. This difference is discussed 
further below. 

Figure 2 also shows the database 222 stored in the 
storage device 221 of tiie database server computer 
system 200. The database 222 is shown as a table that 
maps object identifiers to the data comprising each 
database object In Its streamed form, In which database 
objects may be transmitted over the network. The data- 
base 222 further contains indications of the read and 
write locks outstanding for each database object which 
con'espond to authority to read and write the database 
object, respectively, by database-accessing programs. 
For each valid object thereafter, the database 222 con- 
tains a row that maps from the object identifier to the 
database object refen-ed to by the object identifier, 
thereby defining the site of database objects in the data- 
base. While the database 222 is shown as a simple 
table, because of its large size, it is preferably imple- 
mented using well-known advantageous indices. It 
should further be noted that the database 222 and the 
caches 281, 283, and 285 are shown with incomplete 
contents to better illustrate their use by the facility. 

It can be seen by comparing program A cache 183 
and program B cache 185 that objects having object 
identifiers "1812," "2217." "8760," and "8991" have 
recently been retrieved from the database by database- 
accessing program A, while database objects having 
object identifiers "4079." "2039." "6213." "7170." and 
"2213" have recently been retrieved from the database 
by database-accessing program B. It can further be 
seen that at least a subset of these objects have been 
stored in the process cache 281. In order to better 
describe the facility, its operation is discussed herein in 
conjunction with the series of examples relating to the 
data 250 shown in Figure 2. 

Figure 3 is a flow diagram showing the steps prefer- 
ably performed by the facility when a new database- 
accessing program begins executing in a particular 
process. These steps are preferably executed in the 
facility by the thread executing the new database- 
accessing program. In step 301. the faciilty creates a 
program cache for the database-accessing program, 
such as program A cache 283 (Rgure 2). In step 302. 
the fadlity creates a lock for the program cache, such as 
program A cache lock 284 (Figure 2). Both steps 301 
and 302 preferably involve allocating a required amount 
of free memory from the process address space for the 
process in which the database-accessing program is 
executing. The facility preferably selects the process in 



which to execute a new database-accessing program in 
order to group within the same process database- 
accessing programs likely to access a common group of 
database objects. These steps then conclude. 

5 Rgure 4 is a flow diagram showing the steps prefer- 
ably performed by the facility when a database-access- 
ing program is being terminated. These steps are 
preferably executed in the facility by the thread execut- 
ing the database-accessing program being terminated. 

10 In step 401, the facility deallocates the memory allo- 
cated to the program cache for the database-accessing 
program. In step 402. the facility deallocates the mem- 
ory allocated to the lock for the database-accessing pro- 
gram. These steps then conclude. 

IS Rgure 5 is a flow diagram showing the steps prefer- 
ably performed by the facility In order to obtain a data- 
base object using its object klentifier. These steps are 
preferably executed in the facility by a thread executing 
one of the database-accessing progranns. In step 501. 

20 the fecility obtains the lock 284 for the program cache 
283. Step 501 preferafc)ly Involves determining whether 
the lock presently contains a null thread identifier and, if 
so, writing the thread identifier with the current thread 
into the lock. If the lock does not currently contain a null 

25 thread identifier, the facility preferably waits until the lock 
contains a null thread identifier. The database-access- 
ing program is executing on one of the processors 131- 
134 of the database-accessing computer system 130 
(Rgure 1). TTie first time the database-accessing pro- 

30 gram obtains the program cache lock, the processor on 
which the database-accessing program is executing 
must retrieve the lock from main memory 142. Retriev- 
ing the lock from main memory 1 42 has a relatively high 
time cost, taking as much time on some processors as 

55 is required to execute about 90 instructions. However, 
as part of retrieving the lock from main memory, the 
processor stores the lock in its processor cache. For 
example, processor 131 would store the lock in its proc- 
essor cache 136. Because the lock is now stored in the 

40 processor cache 136, subsequent attempts to obtain 
the lock by the processor 131 on behalf of this data- 
base-accessing program proceed much more quickly 
because the lock need only be relieved from the proc- 
essor cache 131 rather than main memory 142. and 

45 retrieval from the processor cache in some processors 
takes as little time as that required to execute 3 to 12 
instructions. 

In step 502. after obtaining the lock on the program 
cache, the facility determines whetiier the specified 

50 object identifier is in the program cache. If so. ttie facility 
continues in step 503. else the facility continues in step 
505. In step 503, the specified object identifier and data 
for the database object referenced by tiie object kJenti- 
fier are already contained in the program cache. For 

55 exanrple, program A cache 283 already contains object 
identifier "2217." In step 503. the facility releases the 
lock on the program cache. In step 504, the facility 
returns a pointer to the object in the program cache. 



WEST 



9 



EP0843 267A2 



10 



In step 505, the specified object identifier was not in 
the program cacha For example, object identifiers 
'^039" and "4444" are not contained in the program A 
cache 283 (Figure 2). (n step 505, the facility obtains the 
lock 281 on the process cache 282 (Rgure 2). Like the 
program cache lock, the process cache lock must be 
retrieved from main memory the first time it Is obtained. 
Thereafter It is stored in the processor cache for the 
processor where it can be accessed quickly for subse- 
quent attempts to obtain the lock by the same database* 
accessing program. The lock may. however, be 
migrated to the processor cache of another processor 
when a database-accessing program executing on the 
other processor needs to Interact with the process 
cache. 

In step 506, after obtaining the lock on the process 
cache, the facility determines whether tiie specified 
object identifier is in the process cache. If so. the facility 
continues in step 507. else tiie facility continues in step 
509. In step 507. the specified object Identifier Is in the 
process cache. For example, the object identifier "2039" 
is in the process cache 281 . In step 507. the facility uses 
the streamed form of the object stored In the process 
cache to construct the object in the program cache for 
tiie database-accessing program requesting to obtain 
the object so that the database-accessing program can 
Interact with the database object and its program cache. 

Figure 6 is a memory diagram showing the results 
of performing step 507. K can be seen by comparing 
Figure 6 to Figure 2 that the facility has constructed the 
object referenced by object identifier "2039" in program 
A cache 683. Returning to Rgure 5. In step 508. the 
facility releases tiie lock 682 on the process cache 681 
{Figure 6). After step 508, the facility continues in step 
503. 

In step 509. the specified object Identifier Is in nei- 
ther the program cache for the curent database- 
accessing program nor the process cache, such as 
object identifier "4444." In step 509, the facility obtains a 
read lock on the object having the specified object iden- 
tifier in the database. The database server preferably 
only permits the facility to obtain a read lock on the 
object if there are no outstanding write locks on the 
object. If the object has an outstanding write lock, the 
facility preferably waits in step 509 until tiie outstanding 
write lock is released. In step 51 0, after obtaining a read 
lock on the object in the database, a facility copies the 
sf eamed form of the object from tiie database 622 to 
the process cache 681 (Figure 6). In step 510. the facil- 
ity preferably furtiier copies a body of read-ahead data 
from tiie database containing data comprising addi- 
tional database objects whose object identifiers were 
not specified in tiie object reti'ieval request. This read- 
ahead data is preferably also stored in the process 
cache 681 (Figure 6), so that future requests for the 
objects contained in the read-ahead data may be serv- 
iced from ttie process cache instead of incurring the 
cost of accessing tiie database server computer system 



across tiie network and obtaining a read lock on the 
database object in the database. While steps 509 and 
510 as shown accurately portray tiie overall process of 
obtaining the object having ttie specified object identifier 

5 from the database, this process is further optinvzed in a 
preferred embocfiment to account for the relatively large 
time cost involved In obtaining the object from tiie data- 
base. Because tiiis process takes a relatively, long 
period of time, the facility preferably releases the locks 

10 on the program cache and process cache while the 
reti'ieval from the database is taking place. Therefore, 
between steps 506 and 509. ttie facility releases tiie 
program cache lock and tiie process cache lock (not 
shown). In step 510, instead of copying the streamed 

IS form of the object directiy to the process cache as 
shown, the facility preferably copies the streamed form 
of the object to a temporary location within the process 
address space. Then, between steps 510 and 507, the 
facility obtains tiie process cache lock, copies the 

20 streamed form of tiie object from the temporary location 
to the process cache, and obtains the program cache 
lock (not shown). This optimization allows other data- 
base-accessing programs to use tiie process cache 
during the reti'ievcU operation. The optimization further ■ 

2s allows ttie first steps shown in Figure 13, described in 
detail below, to execute against botii the process and 
* program caches during tiie retrieval operation to 
remove from these cache objects that are about to be 
modified. 

30 After step 510, tiie facility continues in step 507 to 
consti-uct the requested object in tiie program cache 
from tiie streamed form of ttie object copied into the 
process cache. The facility constructs objects when 
they are moved from tiie process cache to a program 

35 cache instead of when they are moved from tiie data- 
base to tiie process cache in part because of the copy- 
ing of unrequested read-ahead data comprising 
additional objects to the process cache as part of 
reb'ieving tiie object from the database. Because 

40 objects are only constructed when moved to a program 
cache, the facility does not incur the overhead of con- 
structing ttie additional objects until a database ti'ansac- 
tion needs the additional objects. Figure 7 is a memory 
diagram showing the results of performing steps 510 

45 and 507. It can be seen by comparing Rgure 7 to Figure 
6 that tiie facility has copied the streamed forms of tiie 
objects referenced by object identifiers "4444," "4723," 
"481 1." and "4813" from the database 722 to the proc- 
ess cache 781. It can furtiier be seen tiiat the facility 

so has constructed the object referenced by object identi- 
fier "4444" in the program A cache 783 using the 
streamed form of the object in ttie process cache 781 . 

In addition to obtaining database objects, data- 
base-accessing programs are preferably also able to 

55 modify database objects. Figure 8 is a flow diagram 
showing the steps preferably performed by the fcicility In 
order to update an object and preference by a specified 
object identifier. These steps are executed In tiie facility 
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by a thread executing the requesting database-access- 
ing progrann. In step 801. the facility uses the specified 
object identifier to ok)tain the object as shown in Figure 
5. After step 801. a current constructed copy of the 
database object to be manipulated Is stored In the pro- 
gram A cache 763, such as the database object refer- 
enced by object identifier "4079." In step 802. the facility 
nfianipulales the object In the program A cache 783. 

Figure 9 is a memory diagram showing the results 
of performing step 802. It can be seen by comparing 
Rgure 9 to Figure 7 that the facility has manipulated the 
object referenced by object identifier "4079" in the pro- 
gram A cache 983 by changing its first digit from a '6' to 
a "5." Returning to Rgure 8, in step 803, the facility 
updates the version of the object stored in the dataksase. 
as well as any remaining cached versions of the object, 
to conform with the manipulations to the object in step 
802. Step 803 is discussed in greater detail below in 
conjunction with Figure 1 0. In step 804, if the object was 
successfully updated in the database, then these steps 
conclude, else the facility continues at step 803 to again 
attempt to update the object in the database. For exam- 
ple, the database server may not have been able to 
obtain a write lock for the database object. 

Figure 10 is a flow diagram showing the steps pref- 
erably performed by the facility in order to update a 
manipulated object in the database In accordance with 
step 803. In steps 1001-1005, the facility loops through 
each cache on a local computer system besides the 
program cache containing the manipulated object. In 
step 1002, the facility obtains a lock on the cun-ent 
cache. In step 1003. If the current cache contains a 
copy of the manipulated object, the facility discards the 
object from the cache. In step 1004, the facility releases 
the lock on the current cache. In step 1005. the facility 
loops back to step 1001 to process the next cache. 

Figure 1 1 is a memory diagram showing the results 
of performing steps 1001-1005. It can be seen by com- 
paring Figure 11 to Figure 9 that the facility has 
removed the database object referenced by object iden* 
tifier "4079" from the process cache 1181, as well as 
from the program B cache 1 185. Note, however, that the 
manipulated object referenced by object identifier 
"4079" is still contained in the program A cache 1 1 83. 

Returning to Figure 10, after all the caches have 
been processed, the facility continues in step 1006. In 
step 1006, the facility creates a streamed version of the 
constructed object in the process cache that was 
manipulated in step 802 (Rgure 8). In step 1007, the 
facility causes the database server on the database 
sender computer system to update the modified object in 
the database. Step 1007 preferably involves performing 
a remote procedure call from the database-accessing 
computer system to the database server computer sys- 
tem using known remote procedure call protocols. The 
remote procedure call preferably passes the streamed 
version of the manipulated object to the database 
server in the database server computer syst m. 



Rgure 12 is a flow diagram showing the steps pref- 
erably performed by the database server in order to 
update the version of the manipulated otsject stored in 
the database to reflect the manipulations to th manipu- 

5 lated object. The manipulated object Is identified by a 
specified object identifier. In steps 1201 -1203. the data- 
base server loops through each computer system hold- 
ing a read lock on the object referenced by the specified 
otsject identifier in the database. In step 1202. the data- 

10 base server causes the facility on the current database- 
accessing computer system to discard from its caches 
any occunrences of the objects referenced by the object 
identifier. Step 1202 preferably involves invoking a 
remote procedure call to the facility of the current data- 

is base-accessing computer system. 

Rgure 13 Is a flow diagram showing the steps pref- 
erably performed by the facility on a database-access- 
ing computer system in response to the remote 
procedure call of step 1202. In steps 1301-1305. the 

20 facility loops through each cache stored on the compu- 
ter system. In step 1302, the facility obtains the lock on 
the current cache. In step 1303, if the object is present 
in the current cache, the facility discards It from the 
cache. In step 1304, the facility releases the lock on the 

25 cun'ent cacha In step 1305, the facility loops back to 
step 1301 to process the next cache on the machine. 
After all of the caches have been processed, the facility 
continues in step 1306. In step 1306, the facility 
releases any read locks it holds on the object In the 

30 database. These steps then conclude. 

Returning to Figure 12, in step 1203, the database 
server loops back to step 1 201 to process the next data- 
base-accesslng computer system. In step 1204, the 
database server obtains a write lock on the object refer- 

35 enced by the object identifier in the database. Step 
1204 preferably involves determining that no read locks 
are outstanding for the database object, which should 
be true after the performance of steps 1 201 -1 203. Step 
1204 further requires that there be no outstanding write 

40 locks on the database object. In step 1205, tiie data- 
base server writes the streamed version of the manipu- 
lated object to the database for the specified object 
identifier. 

Rgure 14 is a memory diagram showing the results 
45 of performing step 1205. It can be seen by comparing 
Figure 14 to Figure 1 1 that the facility has modified the 
object stored in the database for object identifier "4079" 
by changing the first digit from a "6" to a "5." Returning 
to Figure 12, in step 1206. the database server releases 
so the write lock on the object in the database. In step 
1207. tiie database server returns an indication of 
whether the manipulated object was successfully writ- 
ten to the database. Step 1207 preferably involves 
returning the result to ttie facility on the database- 
55 accessing computer system manipulating the object 
using known remote procedure call protocols. 

Returning to Rgure 10, in step 1008, if the attempt 
to update the object in tiie database succeeded, tiien 
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the facility continues in step 1009, else the facility con- 
tinues in step 1012. In step 1009, the attempt to update 
the object In the database succeeded, and the facility 
obtains a lock 1482 on the process cache 1481 (Rgure 
14). In step 101 0. the facility stores the streamed abject 
produced in step 1006 in the process cache 1481 (Fig- 
ure 14). Figure 15 Is a memory diagram showing the 
results of performing step 1010. It can be seen by com- 
paring Figure 15 to Rgure 14 that the fecility has stored 
the manipulated version of the database object in the 
process cache 1581. In step 1011, the facility releases 
the lock 1482 on the process cache 1481. In step 1012, 
the facility retums the result of attempting to update the 
object in the database. 

While the present invention has been shown and 
described with reference to preferred embodiments, it 
will be understood by those skilled in the art that various 
changes or modifications in form and detail may be 
made without departing from the scope of the invention. 
For example, tiie facility may utilize caches at additional 
levels of the cache hierarchy. For example, the facility 
may use a conputer system cache on each datafc)ase- 
accessing computer system to cache objects retrieved 
by any of the programs in any of the processes on the 
computer system. Further, the facility may use program 
group caches to cache objects retrieved by a subset of 
the programs in one process. The facility may further 
use different inplementations of caches and databases 
than shown and discussed herein. Indeed, ttie facility 
may be gainfully applied to any type of transaction 
resource managers having concun'ent distributed cli- 
ents, not just to databases. For example, tiie facility may 
be applied to such transactional resource managers as 
a file system, which processes transactions against f fle 
system objects, and a compound document nnanage- 
ment system, which processes simultaneous transac- 
tions to modify sections of a compound document. 
Additionally, the facility may execute database-access- 
ing programs In any number of processes. Also, the 
database server programs may execute on a data- 
accessing computer system instead of a dedicated 
database server computer system. Furtiier, the facility 
may cache database objects obtained from more than 
one database server, either in a single combined cache 
or In separate, per-sen^er caches. 

Claims 

1 . A computer system for processing multiple streams 
of database transactions in a single process using a 
hierarchy of caches, comprising: 

a processor that executes programs generating 
Uie multiple streams of database transactions 
in tiie process; and 

a memory, of which a portion is devoted exclu- 
sively to the process In which the programs 
generating the multiple streams of database 



transactions are executed, tiie portion of mem- 
ory devoted exclusively to the process contain- 
ing a hierarchy of caches ttiat cache 
infomiation retrieved in accordance with the 
5 streams of database transactions, the hierar- 

chy of caches including: 

a head cache that caches Information 
retrieved in accordance witfi any of the 
10 streams of database transactions, and 

a plurality of leaf caches each caching 
information retrieved in accordance witti 
one of ttie streams of database transac- 
tions, 

IS such ttiat information retrieved in accord- 

ance with one of tiie database transactions 
of one of the streams is stored both in the 
root cache and in the leaf cache for that 
stream, and such tiiat retrieving Infbrma- 

20 tion In accordance witti one of the data- 

base transactions of one of the streams 
includes checking the leaf cache for tiiat 
stream for the information to be retrieved, 
and, if the leaf cache for that stream does 

25 not contain the information to be retrieved, 

cheddng the root cache for tiie information 
to be retrieved. 

2. The computer system of claim 1 wherein, at a given 
30 time, because of tiie limited capacity of the caches, 

some Information earlier cached in one of the 
caches Is no longer contained In tiie cache. 

3. The computer system of claim 1 wherein the hlerar- 
35 chy of caches includes an intermediate cache for a 

group of two or more streams, such that information 
retrieved in accordance with one of the database 
transactions of one of the streams In tiie group is 
also stored in tiie intermediate cache, and such that 

40 retrieving information In accordance witti one of the 
database transactions of one of the streams also 
includes checking the intermediate cache for ttie 
information to be retrieved when the leaf cache for 
that sti'eam does not contain the information to be 

45 retrieved. 

4. The computer system of daim 1 wherein the proc- 
essor also executes one or more additional pro- 
grams generating one or more additional streams 

so of database transactions in a second process, 

and wherein a portion of the memory is 
devoted exclusively to tiie second process. 

and wherein the memory further contains a 
super-root cache that caches information retileved 
55 in accordance with any of the streams of database 
transactions produced by programs executing in 
eittier process, such that information retrieved in 
accordance with one of ttie database transactions 
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of any of the streams of database transactions pro- 
duced by progranns executing in either process is 
also stored in the super-root cache, and such that 
retrieving information in accordance with one of the 
database transactions of any of the streams of 5 
database transactions produced by programs exe- 
cuting in either process also includes checking the 
super-root cache for the information to be retrieved 
when the root cache does not contain the infbnma- 
tion to be retrieved. 

5. The computer system of claim 1 . further comprising 
a cache information invalkiation subsystem that 
receives indications that infbrnnation stored In one 

or more of the caches is no longer valid and, in /5 
response, removes the indicated information from 
the caches, such that information not removed from 
a cache by the cache information invalidation sub- 
system may remain in the cache when one transac- 
tion in a series of transactions completes and the 20 
next transaction in a series of transactions begins. 

6. The computer system of claim 1 wherein the data- 
base transactions each specify an object to be 
retrieved, and wherein information retrieved in 25 
accordance with the database transactions con- 
tains the specified objects, which are retrieved and 
stored in the root cache in a streamed form in which 
the programs cannot interact with the retrieved 
specified objects, 30 

tiie computer system furttier comprising an 
object construction subsystem that converts 
retrieved specified objects in the streamed form into 
a constructed form in which one or more of the pro- 
grams can interact with the retrieved objects and 35 
stores tiie retrieved specified objects each in the 
constructed form in the leaf cache for the stream for 
which the retrieved ol:)ject was retrieved. 

7. The computer system of claim 1 wherein the data- 4o 
base transactions each specity an object to be 
retrieved, and wherein information retrieved in 
accordance with the database transactions con- 
tains the specified objects, which are retrieved and 
stored in ttie root cache in a constructed form in 45 . 
which one or more of the programs can interact with 

the retrieved objects, and wherein the computer 
system furtiier comprises an inter-cache object 
copying subsystem tiiat copies the retrieved speci- 
fied objects in the constructed form from the root 50 
cache to the leaf cache for tiie stream for which the 
retrieved object was retrieved to enable the pro- 
gram generating the stream for which the retrieved 
object was retrieved to interact with the retrieved 
specified objects. 55 

8. The computer system of daim 1 wherein informa- 
tion retrieved and stored in tiie root cache in 



accordance with a first database transaction also 
contains additional objects not specified by the first 
database transaction, and wherein tiie object con- 
struction subsystem services a second database 
transaction specifying one of the additional objects 
by converting the additional object specified by tiie 
second database transaction in the streamed form 
into tiie constructed form and storing the additional 
object specified by tiie second database in the con- 
structed form in the leaf cache for the stream con- 
taining the second database transaction. 

9. A computer memory containing a segmented cach- 
ing data structure for caching database objects pro- 
vided by a database server in response to requests 
by a plurality of programs, the segmented caching 
data structure containing: 

a central cache in which are stored copies of 
database objects provided by the database 
server in response to requests by any of the 
programs; and 

for each program, a program cache in which 
are stored copies of database objects provided 
by the database server in response to requests 
by the program, such that, for a request by one 
of the programs, a copy of the requested object 
stored in either the central cache or the pro- 
gram cache for tiie program may be used, obvi- 
ating provision of the requested database 
object by the database sen/er. 

10. A method in a computer system for processing mul- 
tiple streams of database operations in a single 
process using a hierarchy of caches, the hierarchy 
of caches comprising a single process cache and a 
stream cache for each of the streams, the method 
comprising tiie steps of for each operation in each 
stream: 

identifying a database object required by Uie 
operation; 

determining whether the identified database 
object is contained in tiie stream cache for the 
stream; 

if the identified database object is contained in 
the stream cache for ttie stream, using the 
identified database object contained in tiie 
sti'eam cache for tiie stream; and 
if the identified database object is not con- 
tained in the stream cache for tiie stream: 

determining whether tiie identified data- 
base object is contained in tiie process 
cache, 

if the identified database object is con- 
tained in tiie process cache: 
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copying the identified database object 
from the process cache to the stream 
cache for the stream; and 
using the copy of the Identified data- 
base object in the stream cache for the s 
stream. 

if the identified database object is not con- 
tained in the process cache: 

10 

obtaining the identified database 
object from the database; 
storing the identified database object 
obtained from the database in the 
process cache; is 
copying the identified database object 
from the process cache to the stream 
cache for the stream; and 
using the copy of the identified data- 
base object in the stream cache for the 20 
stream. 

11. The method of claim 10, further comprising the 
steps of: 

25 

receiving an indication that a specified object 
may be changed; 

removing any copies of the specified object 
stored in the process cache from the process 
cache; and 30 
removing any copies of the specified object 
stored in any of the stream caches from the 
stream caches. 

12. The method of claim 10 wherein the obtaining step 35 
includes the step of obtaining from the database 
one or more additional database objects other than 
the identified database object, and wherein the 
storing step includes the step of storing the 
obtained additional database objects in the process 40 
cache. 

13. The method of claim 12, further including the steps 
of: 

45 

identifying one of the additional database 
objects as required by a subsequent operation; 
copying the identified additional database 
object from the process cache for the stream 
cache for the stream of the subsequent opera- so 
tion; and 

using the copy of the additional database 
object in the stream cache for the stream of the 
subsequent operation. 
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